SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(: ) APPLICANT: Oon , Chong Jin 
Lim, Gek Keow 
Leong, Ai Lin 
Zhao, Yi 
Chen, Wei Nmg 

(iii TITLE OF INVENTION; A VACC INE - INDUCED HEPATITIS B VIRAL 
STRAIN AND USES THEREOF 

(mi NUMBER OF SEQUENCES: 11 

(IV) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Ladas & Parry 

(B) STREET: 26 West 61 Street 

(C) CITY: New York 

(D) STATE: New York 

( E ) COUNTRY: USA 

( F ) LIP: 10023 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS / MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 09/719, 533 
(r) FILING EATE : 1 3 -DEC - 2 0 0 0 
(C) CLASSIFICATION : 435 

(vin PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : PCT/ SG9 8 / 0 0 04 5 

(B) FILING DATE: 19-JUN-1998 

(vii i ) ATTORNEY/ AGENT INFORMATION: 

( A ) : IAME : Mass, Clifford J . 

(B) REGISTRATION NUMBER: 30,086 

! C) REFERENCE /DOCKET NUMBER : U- 0 1 3 1 0 8 - 9 

( ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (2 12) 708-18 90 

■2) INFORMATION FOR SEQ ID NO : 1 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3215 base pairs 

(E) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{20 TOPOLOGY: circular 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
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CTCCACCACT TTCCACCAAA CTCTTCAAGA TCCCAGAGTC AGGGCCCTGT ACTTTCCTGC >S 

r 3GTGGCTCC AGTTCAGGAA CAGTGAGCCC TGCTCAGAAT ACTGTCTCTG CCATATCGTC 1.: 

AATCTTATCG AAGACTGGGG ACCCTGTACC GAACATGGAG AACATCGGAT CAGGACTCCT 1 

AGGACCCCTG CTCGTGTTAC AGGCGGGGTT TTTCTTGTTG ACAAAAATCC TCACAATACC 2 4 

GCAGAGTCTA GACTCGTGGT GGACTTCTCT CAATTTTCTA GGGGGAACAC CCGTGTGTCT >; v 

rGGCCAAAAT TCGCAGTCCC AAAT C TC C AG TCACTCACCA ACCTGTTGTC CTCGAATTrG S - 

rCCTGGTTAT CGCTGGATGT GTCTGCGGCG TTTTATCATC TTCCTCTGCA TCCTGCTGCT 4. 

ATGCCTCATC TTGTTGTTGG TTCTTCTGGA CTATCAAGGT ATGTTGCCCG TTTGTCCTCT 4 r 

AATTCCAGGA TCAACAACAA CCAGCACCGG ACCATGCAAA ACCTGCACAA CTCCTGCTCA 5 4 

AGGAACCTCT ATGTTTCCCT CATGTTGCTG TACAAAACCT ACGGACAGAA ACTGCACCTG 6-~- 

TATTCCCATC CGATCATCTT GGGCTTTCGC AAAAT AC C T A TGGGAGTGGG CCTCAGTCCG G C 
TTTCTCTTGG CTCAGTTTAC TAGTGCCATT TGTTCAGTGG TTCGTAGGGC TTTCCCCCAC 

TGTCTGGCTT TCAGTTATAT GGATGATGTG GTTTTGGGGG CCAAGTCTGT ACAACATCTT 7 8 

GAGTCCCTTT ATGCCGCTGT TACCAATTTT CTTTTGTCTT TGGGTATACA TTTAAACCCT 8 4 
GACAAAACAA AAAGATGGGG ATATTCCCTT AACTTCATGG GATATGTCAT TGGGAGTTGG 

GGCACATTGC CACAGGAACA TATTGTACAA AAAATCAAAA TGTGTTTTAG GAAACTTCCT 9* : 

GTAAACAGGC CTATTGATTG GAAAGTATGT CAACGAATTG TGGGTCTTTT GGGGTTTGCC ICC 

GCCCCTTTCA CGCAATGTGG ATATCCTGCT TTAATGCCTT TATATGCATG TATACAAGCA 10 8 

aaacaggctt ttactttctc gcaaacttac aagacctttc taagtaaaca gtatctgaac 11 4 

gtttaccccg ttgctgggca acgccctggt ctgtgccaag tgtttgctga cgcaaccc 2c 11'.' 

actggttggg ggttggccat aggccatcag cgcatgcgtg gaagctttgt gtctcctgtg 12t 

gcgatccata ctgcggaact cctagccgct tgttttgctc gcagcaggtc tggggcaaaa 111 

:tcatcggga ctgagaattc tgtcgtgctc tcccgcaagt atacatcatt tccatgggtg i ::* ;- 

gtaggctgtg ctgccaactg gatcctgcgc gggacgtcct ttgtttacgt cccgtcgggg 144 

gtgaatcccg cggacgaccc ctgccggggc cgcttggggc tctaccgccc gcttctccgc l r .< 

2tgttatacc ga2cgaccac ggggcgcacc tctctttacg cggactcccc gtctgtgcgt 1 !!■ f" 

tctcatctgc cggaccgtgt gcacttcgct tcacctctgc acgtcgcatg gagaccaccg 1 tic 

tgaacgccca cgggaacctg cccaaggtct tgcataagag gactcttgga ctttcagcaa 168 

1 1 



TGTCAACGAC CGACCTTGAG GCATACTTCA AAGACTGTGT GTTTAATGAG TGGGAGGAGT 1"40 

TGGGGGAGGA GGTTAGGTTA AAGGTCTTTG TACTAGGAGG CTGTAGGCAT AAATTGGTGT 180 0 

GTTCACCATC AGCATGCAAC TTTTTCACCT CTGCCTAATC ATCTCATGTT CATGTCGTAC 18 61 

TGTTCAAGCC TCCAAGCTGT GCCTTGGGTG GCTTTGGGGC ATGGACATTG ACCCGTATAA 192 0 

AGAATTTGGA GCTTCTGTGG AGTTACTGTC TTTTTTGCCT TCTGACTTTT TTCCTTCTAT 198-1 

TOGAGATCTC CTGGACACCG CCTCTGCTCT GTATCGGGAG GCCTTAGAGT GTCCGGAACA 2 0 4-: 

TTGTTCACCT CACCATACGG CACTCAGGCA AGCTATTCTG AGTTGGGGTG AGTTAATGAA 2'.'. '. 

TGTAGCCACC TGGGTGGGAA GTAATTTGGA AG AT C C AG C A TCCAGGGAAT TAGTAGTCAG 210: 

CTATGTCAAC GTTAATATGG GCCTAAAAAT CAGACAACTA TTGTGGTTTC ACATTTC CTG 2 2 2 : 

TCTTACTTTT GGGAGAGAAA CTGTTCTTGA ATATTTGGTG TCTTTTGGAG TGTGGATTCG 2 2 S ■'. 

CACTCCTCCT GCATATAGAC CACCAAATGC CCCTATCTTA TCAACACTTC CGGAAACTAC 2 2 4: 

TGTTGTTAGA 3GAAGAGGCA GGTCCCCTAG AAGAAGAACT CCCTCGCCTC GCAGACGAAG 2 4 0' 

GTCTCAATCG CCGCGTCGCA GAAGATCTCA ATCTCGGGAA TCTCAATGTT AGTATTCCTT 24o> 

GGACACATAA GGTGGGAAAC TTTACGGGGC TTTATTCTTC TAGGGTACCT TGCTTTAATC 2 r , 2 1 

3TAAATGGCA AAOTCCTTCT TTTCCGGACA TTCATTTGCA GGAGGACATT CTTGATAGAT 2 580 

GTAAGCAATT TGTGGGGCCC CTTACAGTAA ATGAAAACAG GAGACTAAAA TTAATTATGC 2 64 0 

2 IGGTAGGTT T FA 123 C AAA T GTTACTAAAT ATTTGCCCTT AGATAAAGGG ATCAAACCA T 2^00 

ATTATCCAGA GTATGTAGTT AATCATTACT TCCAGACGCG ACATTATTTA CACACTCTTT 2 r '' * 0 

3GAAGGCGGG GATCTTATAT AAAAGAGAGT CCACACGTAG CGCCTCATTT TGCGGGTCAC 2 82 0 

2ATATTCTTG GGAACAAGAT CTACAGCATG GGAGGTTGGT CTTCCAAACC TCGAAAAGGC 2 8 80 

AT3G3GA3AA A 2 2TTTCTGT C 2CCAATCCC CTGGGATTCT TCCCCGATCA TCAGTTGGA 2 2!i-*/ 

20 TGCA T TCA AAGCCAACTC AGAAAATCCA GATTGGGACC TCAACCCGCA CAAGGACAAG 20 0 : 

TGGCCGGACG GC AACAAGGT GGGAGTGGGA GCATTCGGGC CAGGGTTCAC CCCTCCTCAT 3 0 -!> ■ : 

3GGGGACTGT TGGGGTGGAG CGCTCAGGCT CAGGGCCTAC TCACAACTGT GCCAGCAGCT 31.IJ 

GCTCCTCCTG OOTCOACCAA TCGGCAGTCA GGAAGGCAGC CTACTCCCTT ATOTOCAOCT 318 0 

3TAA3 3 3ACA OT3ATOCTCA 3 3CCAT 3CAG TGGAA 82 15 

(2) INFORMATION FOR SEQ ID NO : 2 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 amino acids 
(E) TYPE: amino acid 
(C) STRANDEDNESS : single 
(Dj TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Pro Leu Ser Tyr Gin His Phe Arg Lys Leu Leu Leu Leu Asp Glu 

1 5 10 15 

Glu Ala Gly Pro Leu Glu Glu Glu Leu Pro Arg Leu Ala Asp Glu Gly 
20 25 30 

Leu Asn Arg Arg Val Ala Glu Asp Leu Asn Leu Gly Asn Leu Asn Vai 
35 40 45 

Ser lie Pro Trp Thr His Lys Val Gly Asn Phe Thr Gly Leu Tyr Ser 
50 55 60 

Ser Thr Va 1 Pro Cys Phe Asn Pro Lys Trp Gin Thr Pro Ser Phe Pro 
65 70 75 30 

Asp lie His Leu Gin Glu Asp lie Leu Asp Arg Cys Lys Gin Phe Val 
85 90 95 

Glu Pro Leu Thr Val Asn Glu Asn Arg Arg Leu Lys Leu lie Met Pro 
100 105 110 

Ala Arg Phe Tyr Pro Asn Val Thr Lys Tyr Leu Pro Leu Asp Lys Gly 
115 12 0 12 5 

lie Lys Pro Tyr Tyr Pro Glu Tyr Val Val Asn His Tyr Phe Gin Thr 
130 135 140 

Arg His Tyr Leu His Thr Leu Trp Lys Ala Gly lie Leu Tyr Lys Arg 
14 5 150 155 160 

Glu Ser Thr Arg Ser Ala Ser Phe Cys Gly Ser Pro Tyr Ser Trp Giu 
1 * 5 17 0 1^5 

Gin Asp Leu Gin His Gly Arg Leu Val Phe Gin Thr Ser Lys Arg His 
180 185 190 

Gly Asp Lys Ser Phe Cys Pro Glu Ser Pro Gly lie Leu Pro Arg Ser 
19 5 200 2U5 

Ser Vai Gly Pro Cys lie Gin Ser Gin Leu Arg Lys Ser Arg Leu Gly 
2 10 215 220 

Pro Gin Pro Ala Gin Gly Gin Leu Ala Gly Arg Gin Gin Gly Gly Ser 
225 230 235 24 0 
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Gly Ser lie Arg Ala Arg Val His Pro Ser Ser Trp Gly Thr Val Gly 
245 250 255 

Val Glu Pro Ser Gly Ser Gly Pro Thr His Asn Cys Ala Ser Ser Ser 

2 60 265 2^0 

Ser Ser Cys Leu His Gin Ser Ala Val Arg Lys Ala Ala Tyr Ser Leu 

275 280 285 

lie Ser Thr Ser Lys Gly His Ser Ser Ser Gly His Ala Val Glu Leu 

290 295 300 

His His Phe Pro Pro Asn Ser Ser Arg Ser Gin Ser Gin Gly Pro ;al 

305 310 315 32 0 

Leu Ser Cys Trp Trp Leu Gin Phe Arg Asn Ser Glu Pro Cys Ser Glu 
325 330 335 

Tyr Cys Leu Cys His lie Val Asn Leu lie Glu Asp Trp Gly Pro 2ys 

3 4 0 3 4 5 3 5 0 

Thr Glu His Gly Glu His Arg lie Arg Thr Pro Arg Thr Pro Ala Arg 

3 55 360 365 

Val Thr Gly Gly Val Phe Leu Val Asp Lys Asn Pro His Asn Thr Ala 

370 375 380 

Glu Ser Arg Leu Val Val Asp Phe Ser Gin Phe Ser Arg Gly Asn Thr 

3 85 3 90 3 95 4 00 

Arg Val Ser Trp Pro Lys Phe Ala Val Pro Asn Leu Gin Ser Leu Thr 
405 410 4 15 

Asn Leu Leu Ser Ser Asn Leu Ser Trp Leu Ser Leu Asp Val Ser Ala 

4 2 0 4 2 5 4 3 0 

Ala Phe Tyr His Leu Pro Leu His Pro Ala Ala Met Pro His Leu ^eu 

435 440 445 

Val Gly Ser Ser Gly Leu Ser Arg Tyr Val Ala Arg Leu Ser Ser ^sn 

450 455 46 0 

Ser Arg lie Asn Asn Asn Glu His Arg Thr Met Glu Asn Leu His Asn 

465 470 475 4 80 

Ser Cys Ser Arg Asn Leu Tyr Val Ser Leu Met Leu Leu Tyr Lys Thr 
485 490 4 95 

Tyr Gly Gin Lys Leu His Leu Tyr Ser His Pro He lie Leu Gly Phe 

5 0 0 5 0 5 5 10 

Arg Lys lie Pro Met Gly Val Gly Leu Ser Pro Phe Leu Leu Ala Gin 

515 520 525 

Phe Thr Ser Ala He Cys Ser Val Val Arg Arg Ala Phe Pro His Cys 
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530 535 540 

Leu Ala Phe Ser Tyr Met Asp Asp Val Val Leu Gly Ala Lys Ser Val 
545 550 555 560 

Gin His Leu Glu Ser Leu Tyr Ala Ala Val Thr Asn Phe Leu Leu Ser 
565 570 575 

Leu Gly lie His Leu Asn Pro His Lys Thr Lys Arg Trp GJy Tyr Ser 
580 585 590 

Leu Asn Phe Met Gly Tyr Val lie Gly Ser Trp Gly Thr Leu Pro Gin 
595 600 605 



Glu His lie Val Gin Lys lie Lys Met Cys Phe Arg Lys Leu Pro Val 
610 615 620 

Asn Arg Pro lie Asp Trp Lys Val Cys Gin Arg lie Val Gly Leu Leu 
625 630 635 640 

Gly Phe Ala Ala Pro Phe Thr Gin Cys Gly Tyr Pro Ala Leu Met Pro 
645 650 655 

Leu Tyr Ala Cys lie Gin Ala Lys Gin Ala Phe Thr Phe Ser Gin Thr 
660 665 67 0 

Tyr Lys Thr Phe Leu Ser Lys Gin Tyr Leu Asn Leu Tyr Pro Val Ala 
675 680 685 

Arg Gin Arg Pro Gly Leu Cys Glu Val Phe Ala Asp Ala Thr Pro Thr 
6 9 0 695 700 

Gly Trp Gly Leu Ala lie Gly His Gin Arg Met Arg Gly Thr Phe Val 
705 710 715 7 20 

Ser Pro Leu Pro lie His Thr Ala Glu Leu Leu Ala Ala Cys Phe Ala 
725 730 735 

Arg Ser Arg Ser Glv Ala Lys Leu lie Gly Thr Asp Asn Ser Vai Va i 
740 745 750 

Leu Ser Arg Lys Tyr Thr Ser Phe Pro Trp Leu Leu Gly Cys Ala Ala 
7 55 760 765 

Asn Trp lie Leu Arg Gly Thr Ser Phe Val Tyr Vai Pro Ser Ala Leu 
77 0 775 780 

Asn Pro Ala Asp Asp Pro Ser Arg Gly Arg Leu Gly Leu Tyr Arg Pro 
^85 ^90 795 80 0 

Leu Leu Arg Leu Leu Tyr Arg Pro Thr Thr Gly Arg Thr Ser Leu Tyr 
805 810 815 

Ala Asp Ser Pro Ser Val Pro Ser His Leu Pro Asp Arg Val His Phe 

15 



# 



820 825 830 

Ala Ser Pro Leu His Val Ala Trp Arg Pro Pro 
835 840 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 

iC) STRANDEDNESS : single 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Lys Ala Asn Ser Glu Asn Pro Asp Trp Asp Leu Asn Pro His 
35 40 45 

Lys Asp Asn Trp Pro Asp Ala Asn Lys Val Gly Val Gly Ala Phe Gly 
50 55 60 

Pro Gly Phe Thr Pro Pro His Gly Gly Leu Leu Gly Trp Ser Pro Gin 
65 70 75 30 

Ala Gin Gly Leu Leu Thr Thr Val Pro Ala Ala Pro Pro Pro Ala Ser 
85 90 95 

Thr Asn Arg Gin Ser Gly Arg Gin Pro Thr Pro Leu Ser Pro Pre Leu 
100 105 110 

Arg Asp Tttr His Pro Gin Ala Met Gin Trp Asn Ser Thr Thr Phe His 
j 1 5 120 12 5 

Gin Thr Leu Gin Asp Pro Arg Val Arg Ala Leu Tyr Phe Pro Ala Gly 
130 135 140 

Gly Ser Ser Ser Gly Thr Val Ser Pro Ala Gin Asn Thr Val Ser Ala 
145 150 15 5 16 0 

lie Ser Ser lie Leu Ser Lys Thr Gly Asp Pro Val Pro Asn Met Glu 
16 5 17 0 17 5 

Asn lie Ala Ser Gly Leu Leu Gly Pro Leu Leu Val Leu Gin Ala Gly 
180 185 190 
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Phe Phe Leu Leu Thr Lys lie Leu Thr lie Pro Gin Ser Leu Asp Ser 
195 200 205 

Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Pro Thr Val Cys Leu Gly 
210 215 220 

Gin Asn Ser Gin Ser Gin lie Ser Ser His Ser Pro Thr Cys Cys Pro 
225 230 235 240 

Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie 
245 250 255 

Phe Leu Cys lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu 
260 265 270 

Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu lie Pro Gly Ser Thr 
2^5 280 285 

Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr Thr Pro Ala Gin Gly 
290 295 300 

Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Thr Asp Arg Asn 
305 310 315 320 

Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Lys Tyr Leu 
325 330 33 5 

Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu Val Pro 
340 345 350 

Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser Val 
3 55 360 365 

lie Trp Met Met Trp Phe Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser 
370 375 380 

Pro Phe Met Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 
385 390 395 400 



2) INFORMATION FOR SEQ ID NO : 4 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
15 10 15 
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Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu Ser Phe Leu 
3b 4 0 4 5 

Pro Ser At:.p Phe Phe Pro Ser lie Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala lie Leu Ser Trp Gly Glu Leu Met Asn 
8 5 9 0 9 5 

Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala Ser Arg Glu 
10 0 105 110 

Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys lie Arg Gin 
115 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

Leu Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 1*7 5 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 1 9 C 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
1^5 200 205 

Glu Ser G... n Cys 
210 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 
( E ) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Ala Ala Arg Leu Cys Cys Gin Leu Asp Pro Ala Arg Asp Val Leu 

15 10 15 

Cys Leu Arg Pro Val Gly Ala Glu Ser Arg Gly Arg Pro Leu Pro Gly 
20 25 30 
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Pro Leu Gly Ala Leu Pro Pro Ala Ser Pro Pro Val lie Pro Thr Asp 
35 40 45 

His Gly Ala His Leu Ser Leu Arg Gly Leu Pro Val Cys Ala Phe Ser 
50 55 60 

Ser Ala Gly Pro Cys Ala Leu Arg Phe Thr Ser Ala Arg Arg Met Giu 
65 70 75 8 0 

Thr Thr Va ! Asn Ala His Gly Asn Leu Pro Lys Val Leu His Lys Arg 
85 90 95 

Thr Leu Gly Leu Ser Ala Met Ser Thr Thr Asp Leu Glu Ala Tyr Phe 
100 105 110 

Lys Asp Cys Val Phe Asn Glu Trp Glu Glu Leu Gly Glu Glu Val Arg 
115 12 0 12 5 

Leu Lys Val Phe Val Leu Gly Gly Cys Arg His Lys Leu Val Cys Ser 
13 0 135 14 0 

Pro Ser Pro Cys Asn Phe Phe Thr Ser Ala 
145 150 



(.2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 35 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY: linear 

(Xi) SEQUENCE DESCFIPTION: SEQ ID NO : 6 : 
ATAAGCTTATG CCCCTATCTT ATCAACACTT CCGGA 3 5 



(2) INFORMATION FOR SEQ ID NO : 7 : 

i i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 2 5 base pairs 
(E) TYPE: nucleic acid 
(C.) STRANDEDNESS: single 
(D) TOPOLOGY : linear 

{XI ) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

OAGTCTAGAC TCTGCGGTAT TGTGA 



',2) INFORMATION FOR SEQ ID NO : 8 : 

(l) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 case pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GAGTCTAGAC TCGTGGTGGA CTTCT 2 5 



\2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
(Ci STRANDEDNESS: single 
(Di TOPOLOGY: linear 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

TGAGAATTCT CACGGTGGTC TCCATGCGAC GT 3 2 



(2) INFORMATION FOR SEQ ID NO : 1 0 : 

( i ) SEQUENCE CHARACTERISTICS : 
(A i LENGTH: 16 base pairs 
(Bi TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY: linear 

(xi ) SEQUENCE DESCRIPTION ■ SEQ ID NO: 10: 

TTTGTTTACG TCCCC-T 



2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 



(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
I' ■ TOPOLOGY linear 



(xi ) 



SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 



CTAAGCTTAG TTTCCGGAAG TGTTGAT 
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